• A Fast Solver for Large Tridiagonal Systems on Multi-Core Processors (Lass Library) 

      Valero-Lara, Pedro; Andrade, Diego; Sirvent, Raül; Labarta, Jesús; Fraguela, Basilio B.; Doallo, Ramón (Institute of Electrical and Electronics Engineers, 2019)
      [Abstract]: Many problems of industrial and scientific interest require the solving of tridiagonal linear systems. This paper presents several implementations for the parallel solving of large tridiagonal systems on ...
    • A framework for argument-based task synchronization with automatic detection of dependencies 

      González, Carlos H.; Fraguela, Basilio B. (Elsevier, 2013)
      [Abstract] Synchronization in parallel applications can be achieved either implicitly or explicitly. Implicit synchronization is typical of programming environments that provide predefined, and often simple, patterns of ...
    • A general and efficient divide-and-conquer algorithm framework for multi-core clusters 

      González, Carlos H.; Fraguela, Basilio B. (SpringerLink, 2017)
      [Abstract]Divide-and-conquer is one of the most important patterns of parallelism, being applicable to a large variety of problems. In addition, the most powerful parallel systems available nowadays are computer clusters ...
    • A Highly Optimized Skeleton for Unbalanced and Deep Divide-And-Conquer Algorithms on Multi-Core Clusters 

      Álvarez Martínez, Millán; Fraguela, Basilio B.; Cabaleiro, José Carlos (Springer, 2022)
      [Abstract] Efficiently implementing the divide-and-conquer pattern of parallelism in distributed memory systems is very relevant, given its ubiquity, and difficult, given its recursive nature and the need to exchange tasks ...
    • A multi-GPU shallow-water simulation with transport of contaminants 

      Viñas Buceta, Moisés; Lobeiras Blanco, Jacobo; Fraguela, Basilio B.; Arenaz Silva, Manuel; Amor, Margarita; García Rodríguez, José Antonio; Castro, M.J.; Doallo, Ramón (Wiley, 2012)
      [Abstract] This work presents cost-effective multi-graphics processing unit (GPU) parallel implementations of a finite-volume numerical scheme for solving pollutant transport problems in bidimensional domains. The fluid ...
    • A new thread-level speculative automatic parallelization model and library based on duplicate code execution 

      Álvarez Martínez, Millán; Fraguela, Basilio B.; Cabaleiro, José Carlos; Rivera, Francisco F. (Springer Nature, 2024-03-11)
      Loop-efficient automatic parallelization has become increasingly relevant due to the growing number of cores in current processors and the programming effort needed to parallelize codes in these systems efficiently. However, ...
    • A Parallel Skeleton for Divide-and-conquer Unbalanced and Deep Problems 

      Álvarez Martínez, Millán; Fraguela, Basilio B.; Cabaleiro, José Carlos (Springer Nature, 2021)
      [Abstract] The Divide-and-conquer (D&C) pattern appears in a large number of problems and is highly suitable to exploit parallelism. This has led to much research on its easy and efficient application both in shared and ...
    • A Portable and Adaptable Fault Tolerance Solution for Heterogeneous Applications 

      Losada, Nuria; Fraguela, Basilio B.; González, Patricia; Martín, María J. (Academic Press, 2017-06)
      [Abstract] Heterogeneous systems have increased their popularity in recent years due to the high performance and reduced energy consumption capabilities provided by using devices such as GPUs or Xeon Phi accelerators. This ...
    • A Software Cache Autotuning Strategy for Dataflow Computing with UPC++ DepSpawn 

      Fraguela, Basilio B.; Andrade, Diego (Wiley, 2021)
      [Abstract] Dataflow computing allows to start computations as soon as all their dependencies are satisfied. This is particularly useful in applications with irregular or complex patterns of dependencies which would otherwise ...
    • Accelerating the HyperLogLog Cardinality Estimation Algorithm 

      Bozkus, Cem; Fraguela, Basilio B. (Hindawi, 2017)
      [Abstract] In recent years, vast amounts of data of different kinds, from pictures and videos from our cameras to software logs from sensor networks and Internet routers operating day and night, are being generated. This ...
    • An automatic optimizer for heterogeneous devices 

      Fernández-Fabeiro, Jorge; Andrade, Diego; Fraguela, Basilio B.; Doallo, Ramón (Elsevier, 2020-05)
      [Abstract]: Codes written in a naive way seldom effectively exploit the computing resources, while writing optimized codes is usually a complex task that requires certain levels of expertise. This problem is further increased ...
    • Analysis of interval‐grouped data in weed science: The binnednp Rcpp package 

      Barreiro-Ures, Daniel; Francisco-Fernández, Mario; Cao, Ricardo; Fraguela, Basilio B.; Doallo, Ramón; González-Andújar, José Luis; Reyes, Miguel (John Wiley & Sons Ltd., 2019-09-13)
      [Abstract] Weed scientists are usually interested in the study of the distribution and density functions of the random variable that relates weed emergence with environmental indices like the hydrothermal time (HTT). ...
    • Automated and accurate cache behavior analysis for codes with irregular access patterns 

      Andrade, Diego; Arenaz Silva, Manuel; Fraguela, Basilio B.; Touriño, Juan; Doallo, Ramón (John Wiley & Sons Ltd., 2007-04-03)
      [Abstract] The memory hierarchy plays an essential role in the performance of current computers, so good analysis tools that help in predicting and understanding its behavior are required. Analytical modeling is the ideal ...
    • Automatic mapping of parallel applications on multicore architectures using the Servet benchmark suite 

      González-Domínguez, Jorge; Taboada, Guillermo L.; Fraguela, Basilio B.; Martín, María J.; Touriño, Juan (Pergamon Press, 2012-03)
      [Abstract] Servet is a suite of benchmarks focused on detecting a set of parameters with high influence on the overall performance of multicore systems. These parameters can be used for autotuning codes to increase their ...
    • Developing adaptive multi-device applications with the Heterogeneous Programming Library 

      Viñas Buceta, Moisés; Bozkus, Zeki; Fraguela, Basilio B.; Andrade, Diego; Doallo, Ramón (Springer, 2015)
      [Abstract] The usage of heterogeneous devices presents two main problems. One is their complex programming, a problem that grows when multiple devices are used. The second issue is that even if the codes for these devices ...
    • Easy Dataflow Programming in Clusters with UPC++ DepSpawn 

      Fraguela, Basilio B.; Andrade, Diego (Institute of Electrical and Electronics Engineers, 2019-06-01)
      [Abstract]: The Partitioned Global Address Space (PGAS) programming model is one of the most relevant proposals to improve the ability of developers to exploit distributed memory systems. However, despite its important ...
    • Enhanced global optimization methods applied to complex fisheries stock assessment models 

      Penas, David R.; Gómez, Andrés; Fraguela, Basilio B.; Martín, María J.; Cerviño, Santiago (Elsevier BV, 2019-04)
      [Abstract] Statistical fisheries models are frequently used by researchers and agencies to understand the behavior of marine ecosystems or to estimate the maximum acceptable catch of different species of commercial interest. ...
    • Exploiting Heterogeneous Parallelism With the Heterogeneous Programming Library 

      Viñas Buceta, Moisés; Bozkus, Zeki; Fraguela, Basilio B. (Elsevier, 2013)
      [Abstract] While recognition of the advantages of heterogeneous computing is steadily growing, the issues of programmability and portability hinder its exploitation. The introduction of the OpenCL standard was a major step ...
    • Facilitating the development of stencil applications using the Heterogeneous Programming Library 

      Viñas Buceta, Moisés; Fraguela, Basilio B.; Andrade, Diego; Doallo, Ramón (2017)
      [Abstract] Stencil computations are very common in scientific codes. Heterogeneous systems achieve good results solving these problems, but their programming is complex because of the ghost regions required in multi-device ...
    • GPU Accelerated Molecular Docking Simulation with Genetic Algorithms 

      Altuntas, Serkan; Bozkus, Zeki; Fraguela, Basilio B. (Springer, Cham, 2016)
      [Abstract] Receptor-Ligand Molecular Docking is a very computationally expensive process used to predict possible drug candidates for many diseases. A faster docking technique would help life scientists to discover better ...